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Comparative Schematology 

Michael S. Paterson and Carl E. Hewitt 
Project MAC, M.I.T., Cambridge, Massachusetts 


Introduction 

While we may have the intuitive idea of one programming language having greater 
power than another, or of some subset of a language being an adequate 'core* for that 
language, we find when we try to formalize this notion that there is a serious theoret¬ 
ical difficulty. This lies in the fact that even quite rudimentary languages are never¬ 
theless 'universal' in the following sense. If the language allows us to program with 
simple arithmetic or list-processing functions then any effective control structure can 
be simulated, traditionally by encoding a Turing machine computation in some way. In 
particular, a simple language with some basic arithmetic can express programs for any 
partial recursive function. Such an encoding is usually quite unnatural and impossibly 
inefficient. Thus in order to carry on a practical study of the comparative power of 
different languages we are led to banish explicit functions and deal instead with ab¬ 
stract, uninterpreted programs, or schemas . What follows is a brief report on some pre¬ 
liminary exploration in this area. 


The simplest language we shall study is a flow-chart language with which we write 
am schemas such as shown in Figure 1. When we have provided an interpretation for 
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the baste function symbols and predicate symbols involved, a program schema may be re¬ 
garded as an executable program defining a partial function. As a convenient convention 
we take the arguments of this function to be the initial values of locations l ]p l 2 , 

(up to as many as are mentioned) and the value as being the final value of L Q if this is 
defined. Other locations Mq, Mp *•*, Nq, Np ••• are used only as 'working space'. We 
may as well assume that none of these latter locations is used as an argument before 
being assigned to. 

Another language we shall use provides for the recursive definition of functions 
using conditional expressions. For example: 

f(Xp * 2 ) - if fc ^ en F(if P(x^) then x 2 else A) 

else if P(x 2 ) then true else f(x 2 > F ( x i)) 

g(xp * if P(f(Xp F(x^)) then false else g(A) 

defines two abstract functions by simultaneous recursion. Given an interpretation of the 
basic function and predicate symbols (which are the upper-case symbols in the definitions) 
such a system defines a partial function corresponding to each equation. The system is 
deemed to compute the function given by the first definition. Such a system is called 
a recursive schema . If there is only one equation it is a simple recursive schema, 
if several it is compound . 

Interpretations and Equivalence 

An interpretation . I, for program schemas and recursive schemas provides: 

(i) a domain D 

(ii) for each basic predicate symbol P, a (total) predicate 

Pj : D f true , false ] 

(iii) for each n-ary basic function symbol (n £ 0) F, a (total function) 

F r : D n -» D 

In general we use P, P f , •*• as predicate symbols and other capital letters as function 
symbols. A constant is a 0-ary function. 

The partial function defined by schema S under interpretation I is denoted by S^. 

For two partial functions u, v, we write u ^ v if, for all x, either both of u(x), v(x) 
are undefined or both are defined and u(x) * v(x). Two schemas S, S' (not necessarily 
of the same type) are (strongly) equivalent , S s S' if, for all interpretations I, 



Let be two classes of schemas. We write ^ if 

( Vs € J )(3S' € i')[S s s'] and J < J ' if « J' but not J' < J . Let be the 
class of recursive schemas and fi be the class of program schemas. 
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Theorem 1.. p < R. 

Proof , to show P < fi. involves only a routine construction which we outline below. 

(For further details see [1].) For each flowchart box b t in program schema U, consider 
the abstract partial function f^, which for given values of all the locations is com¬ 
puted as follows. Start the schema at with the locations having these values, and 
then the value of the function is the final value of Lq, if defined. It is easy to 
write a compound recursive schema which defines the f^ recursively. If b^ is the 
START box then the function corresponding to U is got from fg by simply suppressing 

some of its arguments. 

Now to show that the containment is proper, consider the following recursive 
schema V. 

V. f(x) = if P(x) then x 

else H(f(L(x)), f(R(x)) ) 

We shall show that no program schema can be equivalent to V. 

A useful notion is that of a free interpretation. In a free interpretation the 
domain is the set of all strings composed from the basic function symbols. Then, for 
example, if H is a binary function symbol and E^, are strings, 

H^E , E 2 ) «HE 1 E 2 for any free interpretation I. 

The interpretation of the predicate symbols is unconstrained. It should be clear that 
for most purposes, it is sufficient to consider only free interpretations. For example, 

for any S, S', S s S' if and only if Sj ■ for all free I. 

For the present proof (and the next one) we consider the family of free interpre¬ 
tations Cl n ) n > o w ^ ere 

P T (E) ■ true if length(E) * n 
*n 

= false otherwise. 

What is the value of V (X) where X is the empty string? 

L n 

For n * 1, the value is HLR, 
for n ** 2, it is HHLLRLHLRRR 
etc. 

We show that to compute the value for I n at least n + 1 locations are required for 
working space. It helps to present the same situation in a geometric form. We are to 
play a game of placing movable tokens on a finite binary tree, T ft . For n - 3, the tree 
T- is given in Figure 2. The rules of the game are that any token may be put on a bot¬ 
tom node at any time. If the two nodes below a given node are covered then any token 
may be put at the given node. How many tokens are needed to be able to reach the top 
node? It is easy to see that n + 1 are sufficient in general for T n , and the following 
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Figure 2. Binary tree for n » 3. 
argument establishes their necessity. 

A tree will be said to be closed at a given stage of the game if there is at least 
one token on each path from top to bottom. Initially, the tree is devoid of tokens and 
so is not closed. Finally there is a token at the top node and so it is closed. We 
concentrate our attention on the time at which the tree first becomes closed. This can 
only happen as a result of placing a token at a bottom node, closing off the last path. 
Now this path is otherwise empty of tokens and so each of the n sub-trees sprouting im¬ 
mediately off this path must be independently closed. Since at least one token is needed 
to close any tree, there are at least n+1 tokens on the tree at this time. 

The relation between a computation of a program schema and moves in this game we as¬ 
sume to be self-evident. Suppose a program scheme U is equivalent to V and has just r 
locations. This is immediately absurd because there is no way for U to compute the re¬ 
quired output for the interpretation I r . This completes the first proof. 

Program schemas are therefore unequal to the task of computing certain abstract 
functions for the simple reason that a fixed number of locations cannot compute all the 
necessary final values. This is not a very interesting reason so we show in a second 
proof of the same theorem that program schemas can fail for more subtle reasons which 
reflect the inadequacy of their control structure. The "target 1 ' schema for this proof 
computes a partial predicate and is got from V, more or less, by replacing H by the 
Boolean function and. 


Second Proof of Theorem l. Consider the recursive schema: 

W. f(x) * if P(x) then true 

else if f(L(x)) then f(R(x)) else false 

Since the value of this schema is either true or is undefined, no argument of the 
kind given above is useful. Suppose program schema U is equivalent to W and has t boxes 
and r locations, and we may suppose, without loss of generality, that the only predicate 
and function symbols occurring in U are P, L, R. A state of U under a given interpre¬ 
tation is specified by a box of U together with a value from the domain for each loca¬ 
tion. Two states, S^, S£ are (just-for-now) equivalent if the sequences of boxes for the 
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computations continuing from S^, S 2 are the same. Consider the equivalence classes of 
states of U under the interpretation I defined above. With a little thought one can 
see that the only property of the value of a location which can affect the equivalence 
class is the length of the string. Furthermore any two strings of length greater than n 
are indistinguishable by P either immediately or in their future. Thus U has at most 
t # (n + 2) r equivalence classes under I . If, during a computation of U an equivalence 


n 


class is repeated, then the computation is doomed to loop. U_ (K) is supposed to ter- 

1 n 

minate (with value true ), hence must never repeat an equivalence class, and therefore 
runs for no more than t •(n4-2) steps. If n has been chosen sufficiently large that 

2 n > t'(n + 2) r 


then U cannot have time to test each one of 


P(L L ••• L), P(R L • • • L), • • * • , P(R R••• R) 

but nevertheless halts with value true . If we slightly modify I n by letting P be false 
for some such expression which is not tested, then of course U will never notice and still 
give the value true , but the value of W is now undefined. Thus U f W and the proof is 
complete. (Of course this method of proof works equally well for schema V.) 

■ ■ I 

Linearly Recursive Schemas 

The task of effectively characterizing those recursive schemas which are "program- 
mable" is impossible. Indeed, if P G JR, is the subset of those recursive schemas 
which are equivalent to some program schema, we can neither effectively enumerate {fp nor 
R. - P*. This follows from the application of straightforward techniques which are 

described in [1] or [2] and will not be proved here. The best that we can hope for then 
is to effectively characterize large sub-classes of P and J\ - J? . At present we 
have a fairly extensive effective sub-class of - Ip which encompasses both V and W 
and which we are seeking to extend further. 

On the other hand, an approximation to P * is provided by the "linear" schemas 
which are described shortly. A recursive schema may be regarded as giving a way of 
computing the value of a non-basic function given the values of the same or other func¬ 
tions at other arguments. The determination of these latter values will in general re¬ 
quire further recursive calls, and so on. For example, in the following schema: 

f(x, y) - if P(>0 then f(x, S(y)) else 

if P(g(R(x))) then S(x) else H(g(R(x)), R(y)) 

g(x) = if P r (x) then H(g(S(x)), R(g(S(x)))) 

else if P(x) then f(x, x) else A 
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the evaluation of f(x, y) might need the value of f(x, S(y)) or the value of g(R(x)), 
and the evaluation of g(x) might need the value of g(S(x)) or f(x, x). But notice in 
this example, and this is the defining criterion < for a linear schema, that in any eval¬ 
uation at most one further value of a non-basic function is immediately required. 

Theorem 2 . If X is linear then X € p . 

Rather than give a proof of this theorem, we shall give here an example of the 
translation into program schema form of a simple linear program. The proof of Theorem 2 
however, is by reducing the general linear case to a schema isomorphic to this example. 

So the following translation or compilation may be regarded as the "canonical” example! 

L* f(x) « if P(x) then R(x) else S(f(T(x)), x) 

The second argument of S is an important feature, adding to the difficulty of this ex¬ 
ample. Let v rt , v,, ••• v be the values of the successive instances of f called for in 
some terminating evaluation. We observe that: 

v -R(T^(x>) 
mil 

v » M v r+ 1 » T^ r *(x)) for r - 0, m-1, 

and further,that if only we had some form of counter able to count up to integers less 

than or equal to m, then we could easily evaluate f by computing in turn 

v . v „, v-,. However, such a counter can be simulated when we realize that: 

nr m-1 f 0 

P x (T^ r ^ (x)) - false for r - 6, 1 

» true for r * m. 

An equivalent program schema to L is given in Figure 3. 

Discussion 

Of course the efficiency of the above translation leaves something to be desired, 
and we are paying for our restriction to only a finite number of locations by an increase 
in the computation time. The cause of the problem in this example is that we are re¬ 
quired to compute, in order, the sequence: 

T <m) (x), T (m " l) (x), *•', TT(x), T(x) 

which the schema L does by computing each term independently all the way up from x, and 
thus requires of the order of iti operations. To investigate this situation of trading 
off between time and space in more detail we have considered the following simple com¬ 
binatorial problem. 
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Program schema equivalent 
to a linear schema. 


Suppose we have always available the value X in a "read-only” location, M^, and we 
have in addition a further k locations, M^, •••, M^. For given fixed m, we want to 
write a shortest possible sequence of instructions of the form: 


M t : * T(M^) 0 < i * k, 0 * j £ k 

to compute successively the values 

T^(X), ••• TT(X), T(X). 

The values may appear in any location, but must be produced in the given order. Let 
L(k, m) be the length of the shortest such sequence of instructions. For example, 

L(l, m) = ■jm (mfl). We have derived an explicit formula for L(k, m) from which we 
can show 

T /i_ \ 1+1/k 

L(k, m)/%/m 

Thus there exists the possibility of computing linear recursive schematic functions 
using a fixed number of locations with reasonable efficiency. However, it is likely 
that some more versatile control structure than that of program schemas is required to 
realize this possibility. 
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We are currently investigating the relations between various augmented forms of 
program schema, such as program schemas with counters, push-down stores, or stacks. 

These augmentations may either be expressed by more complicated definitions of 'schema 1 
or else may be got from the ordinary program schema by fixing part of the interpretation. 
For example, to express the idea of counters we cpuld fix the interpretation of the con¬ 
stant symbol ZERO, function symbols ADDl, SUBl, and predicate symbol POSITIVE? in the 
obvious way. We must take care in applying some of the well-known results of automata 
theory to this area. For example, program schemas with counters are not 'universal' in 
this theory, because the first proof of Theorem 1 shows that no such schema can be equiva 
lent to the recursive schema V. However, we can show that under an appropriate defini¬ 
tion of push-down store program schemas, they are equivalent to the class of recursive 
schemas. 

A Class of Parallel Schemas 

For a very simple extension of recursive schemas we allow a parallel form of con¬ 
ditional expression. 

IF p THEN g ELSE r 

has the value g if p is true, the value r if p is false and in addition if p is undefined 
but g and r are both defined and are equal then the value is g. As a special case of thi 
connective we define: 

p /OR/ g to mean IF p THEN true ELSE g 

So if either p or g has the value true then (p/OR/g) is true, and if both p and g are 
undefined it is also undefined. Let J be the class of schemas we get by extending R. 
with /OR/. 

Theorem 3. & < Jt . 

The proof of this theorem which is too long and detailed to give here is by showing 
that no recursive schema can be equivalent to the following schema. 

S. f(x) * if P(x) then true else (f(L(x))/OR/f(R(x))) 

The value of Sj.(X) for a free interpretation I is true if there is at least one string 
of L's and R's for which P is true and is undefined otherwise.. In the proof we char¬ 
acterize the behavior of recursive schemas under the free interpretation where P is al¬ 
ways false. The only strings on the binary tree of (L, R)-strings that a recursive 
schema can "look at" are those strings within a bounded distance of a finite number of 
paths descending through the tree. Therefore, any recursive schema must fail to test 
certain strings and so will give a different value from S for some interpretations. 
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Conclusion 

We have a clear notion of effective schemes of computation involving uninterpreted 
functions, and should like to have a fairly natural augmentation of program schemas 
capable of representing any such effective computation. A good candidate for this su¬ 
preme position in the heirarchy of schemas would seem to be program schemas with two 
push-down stores. Provided the schema has the ability to put special control constants 
in its stores and to subsequently recognize them, the "universality" of this model ap¬ 
pears assured. 

Throughout this paper we have made the simplifying assumption that only total func¬ 
tions and predicates can appear in interpretations. Removing this assumption changes 
several of our results, and introduces new considerations. For example, to show that 
fit < J we have only to notice that the schema . 

f(x) = (P(x) /OR/ P*(x)) 

has no recursive equivalent. Also the concept of effective computation is now no longer 
unambiguous and depends on the conventions we adopt concerning the evaluation of partial 
basic functions. 
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